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METHOD AND APPARATUS FOR MULTIMEDIA EDITING 

Field of the Invention 

The present invention relates to multimedia and video and audio editing and, in 
particular: to the method of automated or semi-automated production of multimedia, 
video or audio from previously-recorded input content through the application of 
templates; and also to the method of directing, controlling or otherwise affecting the 
application of templates and production of multimedia, video or audio through use of 
information about the input content. 

Background to the Invention 

Techniques and tools exist for the editing, post-production and also creation of 
multimedia and video and audio productions or presentations. These techniques and 
tools have traditionally developed in the movie and video industries where sufficient 
finances and expertise have allowed and directed development of highly flexible tools 
but which require considerable planning and expertise and often multi-disciplinary 
expertise in order to complete a production at all, let alone to a standard level of 
quality. 

Over time these tools have been simplified and reduced in capability and cost 
and several examples are now available in the consumer and hobbyist marketplace, 
typically for use on home computers and often requiring significant investment in 
computer storage, system performance, accelerator or rendering hardware and the like. 
Typically, any one tool is insufficient to complete a product or to complete a production 
to the required standard, therefore requiring investment in several tools. Furthermore, 
these tools are configured to require sufficient expertise to understand them and there is 
also a requirement to learn how to use the techniques. That is, the user must have or 
gain some expertise in the various disciplines within multimedia, video and audio post- 
production. The state-of-the-art tools do not typically provide such expertise. 
Furthermore, there is a known requirement for collaboration of the multi-disciplined 
team tasked with creating a multimedia, video or audio production. Said collaboration 
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is typically a complex process and those unskilled in the art but wishing to create such a 
production find it difficult or impossible to achieve. 

It is an object of the present invention to ameliorate one or more disadvantages 
of the prior art. 

Summary of the Invention 

According to a first aspect of the invention, there is provided an apparatus, 
system and/or method for containing, applying or providing expertise or assistance, 
including collaborative, multi-disciplinary expertise or assistance in the application of 
multimedia, video and audio production and post-production techniques. 

According to a second aspect of the invention, there is provided an apparatus, 
system and/or method for guiding or modifying production, post-production according 
to the information contained within or describing the input content. 

According to a third aspect of the invention, there is provided a method for 
production and post-production processing of multi-media input data the method 
comprising the steps of: 

(a) inputting one or more multi-media input data sets; 

(b) inputting one or more templates; and 

(c) applying the one or more templates to the one or more input data sets 
so to produce a processed output data set for storage, and/or display and/or further 
processing. 

Brief Description of the Drawings 

A number of preferred embodiments of the present invention will now be 
described with reference to the drawings in which: 

Fig. 1 depicts a typical application of derived movie-making techniques; 
Fig. 2 shows a first example of a temporal structure mapping process; 
Fig. 3 show a second example of a temporal structure mapping process; 
Fig. 4 depicts mapping process steps in more detail; 
Fig. 5 illustrates an application relating to post production processing; 
Fig. 6 illustrates incorporation of user- interaction; 
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Fig. 7 presents a pseudo-code representation of a movie director module; 
Fig. 8 presents a pseudo-code representation of a movie builder example; 
Fig. 9 illustrates a typical template in pseudo-code for an action movie; 
Fig. 10 depicts a preferred embodiment of apparatus upon which the multi- 
media editing processes may be practiced; 

Table 1 presents preferred examples of the selection and extraction process; 
Table 2 illustrates preferred examples for the ordering process; 
Table 3 presents preferred examples for the assembly process; 
Table 4 illustrates examples of effects mapping; 
Table 5 depicts a template for a silent movie; 

Table 6 illustrates associations between editing and effect techniques and 
template type. 

Detailed Description 

First Preferred Embodiment of the Method 

Some of the typically poor features of consumer video, that are typically 
visible or obvious or encountered during presentation of said consumer video, may be 
reduced in effect or partially counteracted by automatic application of techniques 
derived from, or substituting for, typical movie-making or video-making techniques. 
These derived, or substitute, techniques can include techniques that are relatively 
unsophisticated compared to those typically applied in the movie-making industry. 
Furthermore, these relatively unsophisticated techniques, upon application to a 
consumer video or multimedia recording or presentation can provide a positive benefit 
to the said video recording or multimedia presentation, or parts thereof. 

Fig. 1 indicates an example system for automatic application of derived or 
substitute movie-making techniques to an input source content, typically a consumer 
audio/video recording or multimedia recording or presentation. Said derived or 
substitute movie-making techniques are, or may be typically applied in two sequential 
steps to the said input source content, as shown in Fig. 1, resulting in the output of 
processed content that is presented or played or stored in preference to or in 
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replacement of the said input source content. The intention of the embodiment is to 
provide a user with an improved presentation or recording by offering the user the 
possibility of using the output (processed) content in substitution for the input source 
content. Said improvements may include or may only include reductions in poor quality 
features of the said input source content. 

It is the intention of the embodiment to operate on or process the provided 
input source content and not to require or force the user to provide or source or record 
additional or replacement or improved input source content in order to effect the quality 
improvement or reduction in poor features claimed for the invention. The embodiment 
is, however, not restricted from utilising other, or additional, or replacement input 
source content or other content sources in order to achieve the stated goal of 
improvement of quality or reduction of poor features in the output content, in 
comparison with the input source content. 

The process indicated in Fig. 1 includes steps: 101, input of source content; 
102, automatic application of temporal structure mapping; 103, automatic application of 
effects mapping; and 104, output of processed content. 

Fig. 1 step 101 may involve any reasonable method known in the art for input 
or capture of source content into a form suitable for subsequent processing by an 
automatic hardware or software or combined system, typically a general purpose 
computer system as described below with reference to Fig. 10. Such methods may 
include digitisation of analogue data, or may include reading a digitised serial data 
stream, for instance, sourced by a Camcorder or DVCamcorder device, and may 
include format conversion or compression or decompression, or combinations thereof 
as well as, typically, writing of the digitised, converted or reformatted data. 111, to a 
suitable storage medium within the aforementioned automatic system ready for 
subsequent processing at 102. 

Fig. 1 step 102 involves the optional mapping of the temporal structure, or 
part thereof, of the stored input source content to a new temporal structure, or part 
thereof, output at 112. Step 102 is a processing step involving modification of the 
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temporal structure of said input content, 111, to obtain said output content, 112, where 
said mapping process may include the reduced case of identity mapping (that is, no 
changes are introduced). Typically, more useful mapping processes, singular, or 
plural, may be involved in Fig. 1 step 102 and these are described herein as both 
preferred embodiments or parts thereof of the invention as well as being examples, 
without restriction, of embodiment of the invention. 

A first example of a useful temporal structure mapping process that may be 
implemented in Fig. 1 step 102, is shown diagrammatically in Fig. 2. This first 
example mapping process involves the reduction of the duration of the content 111, 
from Fig. 1 step 101 when mapped to the output content, 112, as well as a consequent 
overall reduction in the duration of the entire content presentation. The example in 
Fig. 2 assumes a retention of chronological ordering of the input source content. 111, 
when mapped to the output content, 112. The input content comprises one whole 
temporal element, 201, about which little or nothing may be known by the automatic 
system regarding the temporal structure other than, typically, the total duration of the 
content. 111. This content may typically comprise video and audio information 
previously recorded and now provided by the user, as well as possibly including still 
images and other multimedia elements. This content may even have been previously 
processed or even artificially generated, in which case a variety of content types may be 
included. In this first example of the temporal structure mapping process, 250, the 
automatic system may select portions of the input content and reassemble these. The 
important feature of the mapping process in this example is the reduction of overall 
duration of the output content, 112, in comparison with the duration of the input 
content. 111. This automatic reduction of duration of source content can be one of 
several significant techniques for reducing the poor features of said source content or, 
conversely, for increasing the positive perceptual quality of said source content when it 
is presented to viewers. The mapping process, 250, in Fig. 2, in this first example, 
may typically comprise steps of: selection and extraction Fig. 4, 401, of a number of 
content portions, for instance, 261, 262, 263, from 201, which is a timeline 
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representation of input content 111 in Fig. 1; ordering Fig. 4, 402, of content portions, 
261, 262, 263, which in this first example involves retention of the same sequencing or 
chronology of the extracted portions as was present in 201; and assembly Fig. 4, 403, 
of extracted portions 261, 262, 263, to yield the output content, 112, shown in Fig. 2 
5 in a timeline representation, 290. 

More complex mapping processes, 250, are possible, potentially yielding 
better results, or a greater probability of better results than the first example already 
described. For instance, a second example, shown in Fig. 3, may involve more 
knowledge of the temporal structure of the input content. 111, in the mapping process, 

10 250, to yield a better result, 112, or an improved probability of a better result at 112. 
For instance, when the automatic system applies selection and extraction step 401 to the 
input content in Fig. 3, it may have the benefit of some information about the temporal 
structure of the input content. In Fig. 3 an example temporal structure is shown in 
which the input content comprises five consecutive portions, 301, 302, 303, 304, 305, 

15 labelled Clip 1, Clip 2, Clip 3, Clip 4, and Clip 5, respectively. Information 
concerning the duration of these clips may be available with the input content or may be 
measured in standard ways by the automatic system. The selection and extraction step, 
401, now has the opportunity to perform one or more of a variety of functions or 
algorithms utilising this available or measured temporal structure information to select 

20 and extract a portion or portions from the input content. A list of preferred examples 
for selection and extraction step 401 are given in Table 1 and these are provided 
without restriction on the possible methods of performing step 401. A selection and 
extraction step may be obtained from Table 1 by combining any example from each 
colunm, of which, not all combinations need be useful. Step 402 of the mapping 

25 process, 250, may provide a greater variety of ordering methods and/or greater 
predictability or control of ordering methods if access to information about the temporal 
structure of the input content, 111, is available and/or if information about the temporal 
attributes of the selection and extraction process relative to the temporal structure of the 
input content is available. The ordering step, 402, now has the opportunity to perform 
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one or more of a variety of functions or algorithms utilising this available or temporal 
structure information to order portions previously selected and extracted from the input 
content. A selection of preferred examples for ordering step 402 are listed in Table 2 
and these are provided without restriction on the possible methods of performing step 
5 402. Step 403 of the mapping process, 250, may provide a greater variety of assembly 
methods and/or greater predictability or control of assembly methods if access to 
information about the temporal structure of the input content. 111, is available and/or if 
information about the temporal attributes of the selection and extraction process relative 
to the temporal structure of the input content is available and/or if information about the 

10 ordering process relative to the temporal structure of the input content is available. The 
assembly step, 403, now has the opportunity to perform one or more of a variety of 
functions or algorithms or assembly methods utilising this available or temporal 
stmcture information to assemble portions previously selected and extracted from the 
input content and consequently ordered. A selection of preferred examples for 

15 assembly step 403 are listed in Table 3 and these are provided without restriction on the 
possible methods of performing step 403. 

In the simplest of mapping process methods, 250, related or synchronised or 
coincident audio and video data, for example, may be treated similarly. However, 
there are known techniques, some of which may be automated, in the movie-making 

20 and video-making industries for treating audio near video transitions or vice-versa to 
retain or obtain best quality results and these techniques may be employed in mapping 
process 250. 

Following structure mapping, 102, is effects mapping, 103, in Fig. 1. The 
output content, 112, from the structure mapping process, 102, has effect mapping 
25 performed automatically on it, resulting in output content, 113. In the simplest case, 
effects mapping, 103, may be the identity case, in which the input content, 112, is 
unchanged and output at 113. Typically, however, one or more of a variety of effects 
may be automatically applied at 103, to either or both the audio and video content, for 
example, within content 112. These effects may include processes or functions or 
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algorithms well-known in the art and table 4 provides an example list of effects. A 
variance in the order in which effects are applied to the same content typically results in 
different output content and therefore, the particular ordering of effects applied to 
content 112, may also be considered an effect. Effects may be applied without 
5 knowledge of the temporal structure mapping process nor of the input content's 
temporal structure at 111, in which case it may be typical to apply an effect uniformly 
to the whole content at 112. Alternatively, some effects may be applied with 
knowledge of the input content's temporal structure, or with knowledge of the temporal 
mapping process at 102, and typically, such effects may be applied to a portion or 

10 portions of the content, 112. 

In the first embodiment, temporal mapping and effects mapping are, or may 
be, applied automatically to input content to produce output content that may have poor 
features reduced or improvement of quality or both for the purpose of improving the 
perceptual experience of a viewer or viewers. The first embodiment describes said 

15 example or examples in which minimal information is available to the embodiment 
about the input content, amounting to information about the content's total duration or 
perhaps information about the content's segmentation and clip duration and sequence 
(or chronology) and without direction or input or control by the user other than to 
select the entirety of the input content for application to the embodiment. Furthermore, 

20 the first embodiment of the invention may not include user control of, or selection of, 
temporal structure mapping functions or parameters nor of effects mapping functions or 
parameters. Further, the specific temporal mapping function or functions and effects 
mapping functions exercised by the embodiment may be automatically selected without 
user control and without the benefit of additional, extra or external information or 

25 analysis concerning the content and yet the embodiment is capable of producing 
successful results as may be perceived by a viewer of the output content, 113. This 
fact is a likely result of the expectations and critical faculties and the like of the viewer 
as applied to the output content. Thus, it may be said that the first embodiment of the 
invention effectively provides a random and largely unrelated set of temporal structure 
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mapping and effects mapping processes for application to input content with some 
probability of the output content being perceived as improved or reduced of poor 
features by a viewer. 

The temporal mapping process and the effects mapping process may be 
5 described as being, or as being part of, or as obeying rules or rule-sets where the rules 
or rule-sets may include these properties or relations or information or entities: explicit 
declaration or implementation or execution of functions or algorithms or methods for 
performing the structure mapping and/or effects mapping processes and potentially 
other processes; references to said functions, algorithms or methods, where the actual 

10 functions or algorithms or methods may be stored elsewhere, such as in a computer 
system memory or on a medium such as a hard disk or removable medium or even in 
another rule-set; possible relations or associations or attribute or parameter passing 
methods for controlling or specifying information-passing or dependencies between 
functions or algorithms or methods or even between rules or rule-sets; rules or rule-sets 

15 specifying methods of selecting which temporal structure mappings and effects 
mappings will be executed or implemented in any particular application of the 
embodiment; ordering and/or repetition information for the application of mappings or 
of rules or rule-sets; heuristics information or information derived heuristically for 
direction or control of any portion of said rule-set and related information. 

20 A collection of rules or rule-sets and any of the aforementioned properties or 

relations that may be included in or with a rule-set may be collectively described as a 
template. The act of application of the embodiment to input content, as previously 
described by means of application of temporal mapping or mappings and/or effects 
mapping or mappings to said input content and the associated relations or dependencies 

25 between these mappings, may be described as application of a template identically 
describing said application of said mappings and rules or rule-sets to input content to 
derive said output content. 

A further example of the first embodiment involves application of the template 
to multiple input content to achieve one output content. Various types of input content 
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may be accepted including audio, video, graphics, still images, clipart, animations, 
video keyframes or sequences thereof, mattes, live video source, for instance from a 
camcorder or DVCamcorder, or multiple sources of each of these, or combinations of 
these per input content source. In this example, the embodiment, for the purposes of the 
mapping processes, may treat each input content as a portion of the sum of the entire 
input content applied. Further, this embodiment of the invention may or may not have 
information about the relative sequence or chronology of some or all of the multiple 
input content sources, if this is relevant or practical. Several practical applications of 
this example exist, including a personal content display device, perhaps mounted as an 
electronic picture frame in which images or video and/or audio, etc may be displayed 
automatically by the embodiment. In this application, the input content may have been 
previously stored in a memory or on media such as a hard disk drive. Another 
practical application for this embodiment of the invention may be as an entertainment 
device for social occasions such as for parties, in which the embodiment may display 
output processed from multiple input content sources, possibly including live audio and 
video sources for public or social enjoyment. The input content may have been 
previously selected by the user to be suitable for the social occasion and the template or 
templates executed by the embodiment, including any sequencing or execution 
instructions pertainmg to the templates themselves, may also have been preselected by 
the user. 

Second Preferred Em bodiment of the Method 

The template definition made in the first embodiment may be extended to 
include the capability to store and convey and execute or direct the execution of a set of 
rules and associated information and content and functions, mappings and algorithms, 
or any combination or repetition or sequence thereof, where said rules and associated 
information, etc may have been created or defined or arranged by or created or 
authored under the direct or indirect control of an expert or experts or any person or 
persons skilled or experienced in the art or arts of multimedia presentation, movie- 
making, video production, audio production, or similar. The purpose of such a 
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template or templates is to convey and/or control the production or presentation or post- 
production of input content (single or multiple) provided by a user in order to deliver 
output content which may be perceived by a viewer as comparably positively improved 
or reduced of some negative aspects with respect to the unmodified input content (single 
5 or multiple). Such a template or templates may contain heuristics, expert systems, 
procedural rules, script or scripts, parameters, algorithms, functions, provided content 
including at least all types of input content previously described, or references to any of 
these, or even merely data parameters used to setup a machine or system equivalent to 
the embodiment. Said template or templates may also include information in the form 

10 of text, graphics, video, audio, etc capable of describing or approximately describing 
the action or intent of the template or of the template author to a user in order to allow 
said user the opportunity to make a selection of, or between, one or more templates. 

A practical purpose for said template or templates may include, for example, 
the processing of input content to create the appearance of a typical professionally-made 

15 video or movie from the content output by the embodiment. Similarly, it may be 
desirable to create a mood, or genre, or look, or other emotional or perceptible effect 
or feeling in content output by the embodiment from input content which does not 
include said mood, or genre or look, or other emotional or perceptible effect or feeling 
or not to the same degree, in the opinion of the user or the viewer, or both. Typically, 

20 post-production, or making of a video or movie requires team-work & a variety of 
skills for the capture process (acting, directing, script- writing, camera direction, etc) 
and for the post-production process (editing, directing, graphic-arts, music 
composition, etc). Typically this skill-set is unavailable to consumers and business 
people who may commonly use a camcorder, DVCamcorder, still-image camera, audio 

25 recorder, or the like. Portions of this team-work and skill-set may be compiled into a 
template form and made available to users of the embodiment in said template form. 
Since the embodiment assumes the input content has already been captured, therefore 
reducing or limiting the ability of the embodiment, under control of the template, to 
affect the capture process, then the skill-set contained or described or compiled within 
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the template is typically limited to controlling, directing or executing application of the 
embodiment to the post-capture process, as indicated in Fig. 1, wherein the said 
template or templates may replace or control or direct or execute or do any combination 
of these for the mapping processes 102 and 103. 
5 Fig. 5 indicates the second preferred embodiment of the invention in which 

said template or templates may be used to control or direct or execute processing of 
input content in a manner equivalent to or derivative of typical movie or video or audio 
or musical or multimedia post-production techniques in order to produce output content 
that may be perceived by a viewer as positively improved or reduced in negative 

10 aspects. Movie Director, 503, receives an input template or templates, 501, and input 
content (singular or plural), 502. In this preferred embodiment, input content, 502, 
will typically include synchronised, parallel or coincident video and audio content such 
as delivered by a camcorder or DVCamcorder device, or still images or graphical 
content, or niusic or other audio content. Input content, 502, will also typically include 

15 information about the input content, also known as metadata, that may specify some 
temporal structure information concerning the input content. In this second embodiment 
it is assumed that the said information about the temporal structure of the said input 
content is similar in type and quantity to that described in the first preferred 
embodiment. 

20 Movie Director, 503, analyses the rules and other elements contained within 

template(s) 501 and constructs a series of instructions, 504, suitable for interpretation 
and/or execution by movie builder 505. The series of instructions, 504, in the form of 
a render script, typically also containing aspects of an edit decision list (EDL), is 
compiled by the movie director, 503, to typically also include references to input 

25 content, 502, and also possibly references to content provided by the template(s), 501, 
and possibly also references to other elements or entities including functions, 
algorithms, and content elements such as audio, music images, etc, as directed by the 
template(s), 501. Typically, template(s) 501, will direct the movie director, 503, to 
select and describe one or mappings for structure or effects to be applied to the input 
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content, 502, or to said other provided or referenced content. Typically the 
template(s), 501, may have insufficient information for the movie director, 503, to 
resolve all references concerning the input content, 502. Typically, these unresolved 
references may be due to insufficient information to determine which of the input 
5 content is to be operated on by the embodiment, or the location of the input content, or 
similar issues. Movie Director, 503, may obtain sufficient information to resolve these 
issues by requesting or waiting for input by a user via a user interface, 507. Typical 
input at 507 may include selection of one or more input content items or selection of 
portions of content to be made available at 502 as input content, or selection of points 
10 of interest within content to be made available as metadata information to movie 
director 503. Movie director 503, using information sources, 501, 502, 507, outputs a 
render script, 504, with all references resolved within the system so that Movie 
Builder, 505, may find or execute or input the entirety of the referenced items without 
exception. 

15 Movie builder 505 may typically execute or obey render script 504 directly, as 

typically, movie director, 503, has been designed to output the render script, 504, in a 
format suitable for direct execution by movie builder 505. Movie builder 505 may read 
and execute render script contents, 504, in a series-parallel method, as is typically 
required for video and audio parallel rendering or post-production. Additionally, 

20 movie builder 505 may execute or obey render script 504 by any reasonable method 
that will yield the expected result, or the result intended by the author of template(s) 
501. Movie builder 505 may typically be or include a video and audio renderer such as 
Apple Quicktime 3.0 or equivalent. Movie builder 505 may also be or include a 
hardware renderer or a combined hardware and software renderer and may be capable 

25 of realtime operation if a particular application of the embodiment so requires it. It 
may be noted that a difference between the first embodiment and the second 
embodiment, as is visible when comparing Fig. 1 and Fig. 5 is that in the first 
embodiment in Fig. 1, the mapping processes may execute without first compilation 
and resolution of references, whereas in the second embodiment, the rendering 
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processes, which include the mapping processes of the first embodiment, may be 
executed following compilation of a render script derived from a template and 
following resolution of references. Movie builder 505 may typically include or provide 
any or all of the video or movie or multimedia or audio editing and effects and related 
5 functions, algorithms, etc for execution according to the method, order, sequence, etc 
instructed by the render script 504 and as intended or directed by the template 501. 
Movie builder 505 renders, edits, or otherwise modifies the input content, 502, and 
provided content (portion of 501) or other referenced content (possibly present in the 
system), according to the instructions, sequence, and referenced functions, etc included 

10 in render script 504 and outputs the completed production to, optionally, either or both 
of 506, a storage element, or 508, a movie player. The storage system, 506, may be 
used to store the production indefinitely, and may be a device including a camcorder, 
DVCamcorder, or hard disk drive, or removable medium or remote storage accessible 
via the internet or similar or equivalent or a combination of any of these wherein the 

15 output production may be partially stored on any or all of these or may be duplicated 
across any or all of these. Store 506 may optionally store the output production and 
does not restrict the possibility of the output production being played and displayed 
immediately by movie player 508 and display 509, nor does store 506 limit the 
possibility of movie builder 505 being capable of rendering in realtime and also playing 

20 the output production, in which case movie builder 505 and movie player 508 may be 
the identical component within the system of the embodiment. User interface 507 may 
also provide user control of movie builder 505 or of movie player 508, if desired, to 
allow control of features or functions such as starting of execution of movie builder 505 
or starting of playing by movie player 508, or stopping of either 505 or 508 or both. 

25 User interface 507 may also permit a user to specify the location of store 506, if it 
should be used or other related options. Movie player 508 may be or include Apple 
Quicktime 3.0 renderer or any other equivalent or similar movie player. 

A specific example of an application of the second embodiment described in 
Fig. 5 provides for the application of a Silent Movie template, via the system described 
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in Fig. 5, to a user's input content to produce output content that may be perceived by a 
viewer to be similar to or to evoke feelings of or impressions of the silent movie genre 
or style of movie or film. Said Silent Movie template includes rules or relations 
between separate mapping processes, said rules or relations being intended to group or 
5 direct the production of a particular perception or feeling within the content output from 
the system in Fig. 5. Said rules or relations may include passive relations or grouping 
of mapping processes by the method of including said processes within a single 
template and excluding other unwanted processes. Further, said relations between 
mapping processes, etc may be active, being rules or similar and capable of being 

10 executed or operated or decided during execution of the system in Fig. 5. 

Said Silent Movie template may include a typical set of template components 
listed in Table 5. There may be many ways to construct a template or to apply or order 
its components to achieve an equivalent result to that of the Silent Movie production 
and the example in Table 5 is not limiting on these many construction methods or 

15 options or orderings or applications. Said Silent Movie template example in Table 5 
may be considered as an example of passive relationships between template components 
to achieve an overall production and consequent perception, as previously described. 
Many of the components listed in Table 5 may alone typically elicit some perception of 
the Silent Movie genre, but the combination or sum of these elements being coincident 

20 in one template and their sum effect on the input content result in a consequently strong 
perceptual reference or allusion to the Silent Movie genre. 

Fig. 7 includes an example implementation of the Movie Director module, in 
pseudo-code. Fig. 8 includes an example implementation of the Movie Builder, also in 
pseudo-code. Fig. 9 includes an example template implementation, also in pseudo- 

25 code. The template in Fig. 9 has been designed to create a fast-paced, fast-cutting 
production with a fast-beat backing music track to give the impression of an action 
movie. When the example in Fig. 9 is compared with the previous Silent Movie genre 
template description the versatility of the invention may recognised. 
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Table 6 provides example associations between editing & effect techniques and 
template type, where each template type is intended to induce or suggest one or more 
moods or is intended for application to input content of a particular kind or relating to a 
particular event type. 

5 Templates need not be fixed, nor entirely previously authored. A template or 

templates may be modified through various means as part of the method of the 
embodiment, including: inclusion of user-preferences; direct or indirect user- 
modification; inclusion of information or inferences derived from information about 
input content; modification by or in conjunction with another template or templates, 

10 including template algebra (eg. combining elements or intent from several templates to 
create a hybrid template); modification or grouping or consolidation or composition by 
a meta-template; template customisation. 

The method of the embodiment may support a meta-template which is capable 
of acting on a collection of templates, including functions such as: selection of 

15 template(s) based on criteria such as information about input content, user preferences, 
etc; modification, grouping, consolidation or composition of a group of templates; 
customisation of one or more templates, typically under specific user control. 

The method of the embodiment, through design and provision of suitable 
templates, may be used to provide a presentation or album function when applied to, or 

20 operating on input content and/or content provided with a template. 
Third Preferred Embodiment of the Method 

The previous embodiments may be further enhanced or extended by the 
inclusion of user-interactivity or user input or user preferences or user setup or user 
history or any of these. It is especially preferred that templates include the capability 

25 for requesting user input or preferences, or of requesting such information or 
interaction directly or indirectly, and be able to include such information in decision- 
making processes such as die aforesaid rules, in order to determine, or conclude or 
infer and execute, apply or direct a production including at least some of the 
preferences or desires of the user. 
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At its simplest, user interaction or control may include selection of a template 
or templates and selection of input content (single or multiple) for application, 
execution or direction of the former to the latter to output a production. 

Of particular interest is the opportunity for the embodiment, and of the 
5 template(s) therein, to utilise or enquire of the user's potential knowledge of the input 
content to presume or infer the best application, execution or direction of the template 
to said input content. This user-interaction and presumption or inference may be 
implemented in a variety of methods, including the simultaneous implementation of 
several alternative methods. The kinds of knowledge of the input content that may be 

10 obtained include: user preference for, neutrality towards or dislike of one or more input 
content segments; user preference for or dislike of point or points within input content 
segments; user preference for, neutrality towards or dislike of smiilarly-labelled 
sections of input content, for instance, where a database of labelled or partially labelled 
input content may be accessible to the embodiment; user approval or disapproval of an 

15 output production or portion or portions thereof. 

Fig 6. indicates in 600, a possible method of obtaining knowledge of input 
content from a user. The user may be asked or prompted to indicate highlights or 
emotionally significant or otherwise important portion or portions of input content. 
One method of implementing this interaction is to allow the user to indicate a point of 

20 significance, 605, and the embodiment, typically through the application of rules within 
the template(s) may infer zones, 610 and 611, before and after the point of interest that 
may be selected and extracted for inclusion in the production and application by the 
template. Typically, the durations, 606, 607, of the zones of interest, 610, 611, around 
the indicated point, 605, may be determined or defined or calculated within the 

25 template, typically by authored heuristics indicating the required or desired extracted 
content length for any time-based position within the output production. 

Fig. 6. also indicates, 620, a possible method for a user to indicate approval or 
disapproval of portion or portions of the output production, 621. The user may 
indicate a point of approval or disapproval, 625, and this point information may be 
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inferred to indicate an entire segment of the output production, 630, said segment 
typically being extrapolated from said point by means of finding the nearest forward 
and backward content boundaries (transitions) or effects, or by applying a heuristically 
determined time-step forward and backward from 625 that typically relates to the 
5 temporal structure of the output production. 

User interaction may also permit direct or indirect alteration or selection of 
parameters or algorithms or rules to be utilised by the template(s) by means including: 
selection of numerical values for quantities such as clip duration, number of clips, etc; 
indirect selection of clip duration or temporal beat or number of clips through selection 

10 of a particular template with the desired characteristics or through indicating preference 
for the inclusion of a particular clip of a known duration, therefore potentially 
overriding template rules relating to selection of such content; selection from a set of 
style options offered by a template as being suitable (said suitability typically being 
determined heuristically or aesthetically and authored into said template); selection of a- 

15 method or methods, such as a clip selection method preferring to select content from a 
localised region of the input content. A template may provide means and especially 
rules for requesting all such information or options or preferences or for^ indirectly 
requesting said information or for allowing user-selection of said information. A 
template may not require said information but may assume defaults or heuristics, etc 

20 unless said information is offered, or directed by the user. The act by a user of 
selecting a template may define or determine some or all of said options or parameters 
or selections by means of defaults or default methods being within said template. 

A template may offer a user a hierarchical means or other equivalent or similar 
means for selecting, modifying or controlling or specifying parameters, methods, etc. 

25 Said hierarchical means of permitting or requesting user input may be implemented by 
initially requesting or offering generalisations for the production, for instance, the 
template selection may be the first generalisation, eg. selection from a wedding 
production template or a birthday production template. A next level of hierarchical 
selection may be the choice of a church wedding or a garden wedding production within 
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the template. Said choice may effect a change to styles and colours or music or related 
rules and methods within the template. A next level of hierarchical selection may be the 
choice of music or styles within a segment of the production relating to a particular 
input content segment. Thus, if the user is willing or interested or skilled enough to 
wish to specify detailed control of a production and the directing template then a 
hierarchical method may be appropriate for permitting such control where it is 
requested or required without demanding or enforcing the same level of detailed control 
for the entirety of a production where it may be unnecessary or undesirable. 

Further examples of user input to or control of or interaction with template 
rules include: choice of long, medium or short temporal structure mappings; choice of 
clip durations; choice of backing music; inputting text information into generated titles 
or dialogue mattes or credits or similar; selection of clipart or effects or styles from a 
range of options offered or referenced by a template or system. Some specific user 
control examples relating to the template examples already described include: optional 
chroma bleed-in at a selected point within the Silent Movie production to obtain the 
benefit of colour after the mood has first been set by the effects; textual input to 
dialogue mattes within the Silent Movie template example and also into the titles and 
end titles in the action template example. A further example of interaction with a user 
includes storyboard interaction in which the embodiment, and desirably a template(s) 
may include and display features of a storyboard, including images, representative 
images, icons, ammation, video, a script, audio, etc to convey properties and 
organisation of the template and/or production to a user to permit or enable easy 
comprehension, to assist or guide the user and to permit or enable easy modification, 
adjustment, editing, or other control of the embodiment and/or the output production by 
the user. 

User preferences, historical selections or choices, name, and other information 
may also be recalled from previous use and storage, etc to be utilised by template rules 
or as elements within a production (eg. as textual elements in titles, etc). Also, 
previously created productions, previously utilised or modified templates, previously 
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Utilised or selected input content, or groups of these may be recalled for subsequent use 
as input content or preferences or templates, etc. 
Fourth Preferred Embodim ent of the Method 

A fourth embodiment of the invention is also capable of using information 
5 about the input content in order to: select or adjust appropriate items or parameters to 
suit or match or fit the mood of the subject or other property of input content, be said 
input content any of audio, video, still-frames, animation, etc; advise, prompt or hint to 
the user a selection, choice, single option, alternative, parameter range, style, effect, 
structure, template, etc, for the user to operate on. 

10 This said capability of the embodiment to use information about the input 

content may be used, engaged or activated in conjunction with any of the previously 
described embodiments or examples of the invention in any reasonable combination of 
functions or features. 

The information about the input content may be obtained from an external 

15 source, such as a DVCamcorder, for instance. Canon model Optura, which is capable 
of performing some content analysis during recording of said content. Said -external 
source may provide said information, also described as metadata, from content analysis 
or a recording of results from an earlier content analysis operation, or metadata may be 
supplied from other information also available at the time of recording the content. 

20 Such other information may include lens and aperture settings, focus information, zoom 
information, and also white balance information may be available. Information made 
available by said external source may include motion information, for instance, the said 
DVCamcorder is capable of providing motion information as part of its image encoding 
method and this information may be extracted and used for other purposes such as those 

25 described for this embodiment. Further examples of other information that may also be 
available from the external input content source include: time, date or event 
information, for instance, information describing or referencing or able to be linked to 
a particular event on a particular day such as a sporting event; locality or geographical 
information, including GPS (Global Positioning System) information. 
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The embodiment may also be capable of analysing input content and providing 
its own information source or metadata source. Such analyses may be performed to 
obtain metadata including these types: audio amplitude; audio event (loud noises, etc); 
audio characterisation (eg. identifying laughter, voices, music, etc); image motion 
5 properties; image colour properties; image object detection (eg. face or body detection); 
inferred camera motion; scene changes; date, time of recording, etc; light levels; voice 
recognition; voice transcription; etc. 

The more that the embodiment is capable of inferring about the subject or 
action or other details of a scene or event recorded within input content then the more 

10 capable the embodiment may be to perform template functions, including: searching, 
selection and extraction of clips from input content, for instance, to find events of 
interest or appropriate to the mood of the applied template; to infer relationships 
between portions of input content and to maximise benefits from these through control 
of clip-adjacency, chronology, subject selection frequency, etc; to infer the subject of 

15 input content in order to select an appropriate template, or function or effect within a 
template; to select appropriate transition properties, eg. colour, speed, type, based on 
information about the input content such as colour, motion and light level. 

The embodiment may also include the capability to access and search input 
content by means of labels applied to or associated with or referencing the input 

20 content. Said labels may have been applied by the embodiment itself or by any other 
source. Said labels may be applied in patterns to label portions of input content 
according to any rule method required or appreciated by the user or by any source 
acting on the user's behalf or under the user's instructions. Thus, an input content 
section may contain labels describing specific subjects within the content, such as the 

25 user's family, and the embodiment may utilise these labels to select or avoid selecting 
said labelled portions based on instructions provided to the embodiment. Said 
instructions need not be provided directly by a user. For example, the user may select 
a template which has been previously defined, and is declared to the user through an 
appropriate mechanism, eg. the template name, to select for family snapshots or video 
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clips. Said family snapshot template may include reference to a labelling scheme which 
permits it to interpret the labels previously placed on or associated with the input 
content, therefore the embodiment need not require direct user action or control 
concerning the labelled input content. The embodiment may support the user's optional 
5 wish to override or modify the labels, thereby allowing control of the current 
production process and possibly of future production processes if permanent 
modifications are retained for the labels. 

Preferred E mbodiment of Apparatuses^ 

10 The multi-media editing processes are preferabfy practiced using a 

conventional general-purpose computer, such as the one shown in Fig. 10, wherein the 
processes of Figures 1 to 6 are implemented as software executing on the computer. In 
particular, the steps of the editing methods are effected by instructions in the software 
that are carried out by the computer. The software may be divided into two separate 

15 parts; one part for carrying out the editing methods; and another part to manage the 
user interface between the latter and the user. The software may be- stored in a 
computer readable medium, including the storage devices described below, for 
example. The software is loaded into the computer from the computer- readable 
medium, and then executed by the computer. A computer readable medium having 

20 such software or computer program recorded on it is a computer program product. The 
use of the computer program product in the computer preferably effects an 
advantageous apparatus for multi-media editing in accordance with the embodiments of 
the method of the invention. 

The computer system 1000 consists of the computer 1002, a video display 

25 1004, and input devices 1006, 1008. In addition, the computer system 1000 can have 
any of a number of other output devices including line printers, laser printers, plotters, 
and other reproduction devices connected to the computer 1002. The computer system 
1000 can be connected to one or more other computers via a communication interface 
1010 using an appropriate communication mechanism such as a modem 
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communications path, a computer network, or the like. The computer system 1000 can 
also be optionally connected to specialised devices such as rendering hardware or video 
accelerators 1032 by means of communication interface 1010. The computer network 
may include a local area network (LAN), a wide area network (WAN), an Intranet, 
5 and/or the Internet 

The computer 1002 itself consists of one or more central processing unit(s) 
(simply referred to as a processor hereinafter) 1014, a memory 1016 which may include 
random access memory (RAM) and read-only memory (ROM), input/output (lO) 
interfaces 1010, 1018, a video interface 1020, and one or more storage devices 

10 generally represented by a block 1022 in Fig. 10. The storage device(s) 1022 can 
consist of one or more of the following: a floppy disc, a hard disc drive, a magneto- 
optical disc drive, CD-ROM, magnetic tape or any other of a number of non- volatile 
storage devices well known to those skilled in the art. Each of the components 1020, 
1010, 1022, 1014, 1018, 1016 and 1028 is typically connected to one or more of the 

15 other devices via a bus 1024 that in turn can consist of data, address, and control buses. 

The video interface 1020 is connected to the video display 1004 and provides 
video signals from the computer 1002 for display on the video display 1004. User 
input to operate the computer 1002 is provided by one or more input devices. For 
example, an operator can use the keyboard 1006 and/or a pointing device such as the 

20 mouse 1008 to provide input to the computer 1002. 

The system 1000 is simply provided for illustrative purposes and other 
configurations can be employed without departing from the scope and spirit of the 
invention. Exemplary computers on which the embodiment can be practiced include 
IBM-PC/ ATs or compatibles, one of the Macintosh (TM) family of PCs, Sun 

25 Sparcstation (TM), or the like. The foregoing is merely exemplary of the types of 
computers with which the embodiments of the invention may be practiced. Typically, 
the processes of the embodiments, described hereinbefore, are resident as software or a 
program recorded on a hard disk drive (generally depicted as block 1026 in Fig. 10) as 
the computer readable medium, and read and controlled using the processor 1014. 
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Intermediate storage of the input and template data and any data fetched from the 
network may be accomplished using the semiconductor memory 1016, possibly in 
concert with the hard disk drive 1026. 

In some instances, the program may be supplied to the user encoded on a CD- 
ROM 1028 or a floppy disk 1030, or alternatively could be read by the user from the 
network via a modem device 1012 connected to the computer, for example. Still 
further, the software can also be loaded into the computer system 1000 from other 
computer readable medium including magnetic tape, a ROM or integrated circuit, a 
magneto-optical disk, a radio or infra-red transmission channel between the computer 
and another device, a computer readable card such as a PCMCIA card, and the Internet 
and Intranets including email transmissions and information recorded on websites and 
the like. The foregoing is merely exemplary of relevant computer readable mediums. 
Other computer readable mediums may be practiced without departing from the scope 
and spirit of the invention. 

The method of multimedia editing may alternatively be implemented in 
dedicated hardware such as one or more integrated circuits performing the functions or 
sub functions of the editing. Such dedicated hardware may include graphic processors, 
digital signal processors, or one or more microprocessors and associated memories. 

The foregoing only describes a small number of embodiments of the present 
invention, however, modifications and/or changes can be made thereto by a person 
skilled in the art without departing from the scope and spirit of the invention. The 
present embodiments are, therefore, to be considered in all respects to be illustrative 
and not restrictive. 
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ASPECTS OF THE INVENTION 



The following numbered paragraphs set forth aspects of the invention: 

5 1. A method for production and post-production processing of multi- 

media input data the method comprising the steps of: 

(a) inputting one or more multi-media input data sets; 

(b) inputting one or more templates; and 

(c) applying the one or more templates to the one or more input data sets 
10 so to produce a processed output data set for storage, and/or display and/or further 

processing. 

2. A method according to claim 1, whereby the one or more templates 
include a temporal structure mapping process and/or an effects mapping process, and 

15 whereby the applying of the one or more templates in step (c) comprises the sub-steps 
of: 

(d) applying the temporal stmcture mapping process to the input data sets 
so to produce a temporally structured data set; and 

(e) applying the effects mapping process to the temporally structured data 
20 set so to produce the processed output data set. 

3. A method according to claims 1 or 2 whereby the one or more 
templates include rules and/or references, and whereby the applying of the one or more 
templates in step (c) comprises the sub-steps of: 

25 (f) constructing and/or deriving a series of instructions; 

(g) accepting one or more user inputs if necessary to complete the series 
of instructions; and 

(h) applying the instructions in a series-parallel manner to the one or more 
input data sets so to produce the processed output data set. 
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Automatic Application of Derived Movie-making Techniques 
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Table 1 . 



Selection and Extraction Method Examples 


Selected Portion 
Start 


Extracted Duration 


Relationships 
between Selections 


Within whole 
content . 


Random . 


Random, 

chronological , 
without overlap. 


Within clip. 


Less than or equal 
to clip length. 


Random, chronology 
ignored, overlap 
ignored . 


Within a group of 
clips . 


Spanning one or 
more clips 
recorded within 
the same day. 


From separate clips. 


Heuristically 
obtained, eg. 
assume zones of 
interest in 
recorded content 
occur primarily 
near clip 
startpoints . 


Heuristically 
obtained, eg. 
related to human 
attention span. 


From a group of 
clips recorded 
within the same day. 


Multi-pass 
(repetitious ) 


Limited so as to 
limit total output 
duration (eg. 
based on 
heuristics ) . 


From all clips 
within whole 
content . 




Short durations 
followed by longer 


Repetitious, for 
instance, to 
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durations (eg. 


lengthen output 




applied to 


content duration 
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with respect input 




selection ) 


content duration. 
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Table 2 . 

Ordering Method Examples 

Sequential or chronological 

Random 

Reverse -chronological 

Flashback (later chronologies displayed or 

duplicated early in the order) 

Montage (later chronologies displayed in brief 

early in the order) . f_ 

Cutaway (two related or consecutive portions 
separated by an unrelated or distant portion) 
Alternate 
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Table 3. 

Assembly Method Examples 

Cut (butt-edit) 

Short Dissolve 

Long Dissolve 

Fast Wipe 

Slow Wipe 

Graphic 
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Table 4 , 

Effects Mapping Examples 

Addition of Sound effect 

Removal of chrominance 

Addition of artificial scratches and dust 
Composition or overlay of sprites, animation, 

graphics 

Addition of Music 

Luminance or chrominance keying or matteing t 
Dissolve or mixing of other content 
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Table 5. 



Silent Movie Template Components Example 


Component 


Purpose 


Four well -separated random 
video selections from input 
content . 


Selection of sufficiently 
differing activities or 
incidents from the input 
content to create surprise or' 
reduce boredom. 


Extract limited duration 
clips for each selection, 
each preferably less than 2 
minutes in duration . 


Limit clip duration to the 
effective viewer attention 
span and avoid boredom. 


Filter clips to remove all 
chrominance information . 


Replicate black and white" 
characteristic of Silent 
Movie qenre . 


Remove original audio 
information . 


Replicate silent 
characteristic of Silent 
Movie qenre . 


Add piano soundtrack . 


Replicate characteristic of 
Silent Movie qenre. 


JInsert dialogue mattes at 
clip boundaries . 


Replicate characteristic of 
Silent Movie genre. 


A^PPly scratch and dust 
filter. 


Replicate characteristic of 
Silent Movie qenre. 


Cut in titles, dialogue 
mattes and clips . 


Replicate hard -cut 
characteristic of Silent 
Movie qenre . 


Insert fade-in from black to 


Include title in 
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/3/20 



title dialogue matte. 


characteristic style of 
Silent Movie genre. 


Insert fade-out to black from 
end- title dialogue matte. 


Include end-title in 
characteristic style of 
oixenu iMovre genre. 


Insert film projector 
sprocket hole sound over 
title. 


Replicate projector sound- 
effect characteristic of 
Silent Movie genre. 
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Table 6 . 



Example Associations between Editing & Effect Techniques and Template Type 




Romance 
Montage 


Action 
Montage 


Continuity 
Template 


Silent Movie 












Transitions 


Fade 










Fade out 










Fade in 








y 


Dissolve 










Cross-fades 










Wipe 










Quicl^hip 










Audio 




















Sound Types 


Actual Sound 










Sound effects 








y 


Atmos sound 








y 


Voice over 




















Cuts 


Cross cut 










Continuity cut 


-/ 


-/ 




y 


Compilation cuts 










Split editing 










Parallel cutting 










Classical cutting 








y 












Editing effects 


Cutaways 


-/ 








Insert 




■/ 






Subliminal cuts 










Rashbacks 






■/ 




Freeze-frames 


V 








Frequency 










Duration 










Montages 










Rhythm 










Reverse shot 




















Shot length 


Same length 










Slow cutting 










Fast cutting 










Cut to beat/music 
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Fig . 7 

Movie Director Example Implementation (Pseudo-code) 
main( ) 
begin 

5 create rule list 

create parameter list 

create item list 

create rule syntax table 

10 get template file name 

load ( template_f ile_name ) 



15 



20 



get render script file name 
create render script file 

get input content file names 
create content list 

contentparse(content_list , input_content_f ile_names , 



ruleparse ( installed_rules , content_list / render_scr ipt_f ile ) 
save render_script_f ile 
close render_script_f ile 
exit 

end 

25 

load ( template_f ile_name ) 
begin 

while (not end of template_f ile ) 
begin 

30 get next item 

if ( item_type == reference) 

resolve ( item ) 
else if (item_type == rule) 
install rule_name 
35 else if ( item_type == parameter) 

write ( parameter_list , parameter_name ) 
else if ( item_type == rule_syntax_extension ) 

write ( rule_syntax_table, rule_syntax_extension ) 

else 

40 write ( item_l is t , item_name) 

end 

end 



resolve ( ref erence_name ) 
45 begin 

if (ref erence_type provided„content ) 
begin 

get provided content file name 
50 contentparse(content_list, 

provided„content_f ile_name) 

end 
else 

get referenced item 

55 end 
end 



contentparse ( content_list , content_f ile__name , ... ) 
60 begin 

while (not last content item) 
begin 

if (content_f ile_narae_type directory) 
begin 
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get directory contents 
contentparse ( content_list , 

directory_content_f ile_names , ...) 

end 

6 else 

begin 

get content information 

write ( content_list , content_f ile_name, 
content_inf ormation ) 

10 end 

end 

end 

15 ruleparse (rule_list , content_list , render_script_f ile_name ) 
begin 

create instruction list 
while (not last rule) 
begin 

20 get rule 

decode ( instructions, operands, rule, content_r_ef erences , 

paraineter_ref erences , 
item_ref erences ) 

end 

25 

get instruction list 

while (not last instruction) 

begin 

execute instruction ( operands ) 

30 end 
end 

decode ( instructions , operands, rule, content_ref erences , 
35 parameter_ref erences , item_ref erences ) 

begin 

while (not end of rule) 
begin 

get next portion 
40 if (portion_type == instruction) 

begin 

read (portion) 

convert portion according to rul^ syntax table 
write ( instruction__list , instruction ) 
45 end 

else 

begin 

read ( reference ) 

convert portion according to rule syntax table 
50 write( instruction_list , operand) 

end 

end 

end 
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Fig . 8 

Movie Builder Example Implementation (Pseudo-code) 

main ( ) 
begin 

5 get render script file name 

get destination movie file name 
open render script file 
create qt„movie_f ile 

parse ( render_script_file, qt_movie_f ile ) 
10 close render__script_f ile 

save qt_movie_f ile 
close qt_movie_f ile 
exit 



15 



45 



60 



end 



parse ( scr ipt_f ile_name , qt_movie_f ile ) 
begin 

while(not end of script_file) 
begin 

20 get next script file line 

parse_line ( script_f ile_line, qt_movie_f ile ) 

end 

end 

25 parse_l ine ( scr ipt_f ile_line , qt_movie_f ile ) 
begin 

get first word of line 
if "//" return 
else if "video" then 
30 video ( script_file_line, qt_movie_f ile ) 

else if "audio" then 

audio ( script_file_line, qt_movie_f ile ) 
else if "transition" then 

transition ( script_file_line, qt_movie_f ile ) 

35 else 

flag error in script file 

end 

video ( scr ipt_f ile_line , qt_movie_f ile ) 
40 begin 

parse video paramenters 

add video to qt_movie_f ile using QT API 

end 



audio ( script_f ile_line , qt_movie_f ile ) 
begin 

parse audio paramenters 

add audio to qt_movie_f ile using QT API 

end 



trans ition( scr ipt„file_line, qt_movie_f ile) 
begin 

parse transition paramenters 

add transition to qt_movie_f ile using QT API 

55 end 
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Fig . 9 

Template Example Implementation ( Pseudo-code) 



10 



15 



20 



25 



30 



Fast-paced, quick cutting, fast beat, 
//chronology not strictly enforced 
//repetitive temporal structure 
//one long clip may contribute 2 elements 

//do not use first/last second of clip 



crossfade //3-4 frame crossfade 

//sync video clip lengths to music beat 
//specify backing music characteristics 
//remove all original audio 



//Action Template 
cut_order = chronological 
structure ^ 10s, 4s, ... 
intraclip„cutting = 2 
intraclip_spacing = 2s 
avoid_cutting = Is, -Is 
cut_method = random, clip 
play_order = forward 
structure_transition = 3, 
beat_synchronise = true 
back_track = action 
audio_action = mute_all 
title = action_title 
end_title = action_end_title 

//function definition 

length check_f it ( content-length, structure, intraclip_spacing, 
intraclip_cutting, avoid_cutting ) 

begin 

length = content_length - avoid_cutting [ 0 ] + avoid_cutting [ 1 ] 

- structure [0] 

X = intraclip_cutting 

while (X > 1) 

begin 

X = X - 1 

length = length - structure [x] - intraclip_spacing [x] 

end 

return length 

end 
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n/to 

main() //start 
begin 

trim (title, beat_synchronise, structure_transition ) 
assemble_edit (output, title, play_order, structure_transition , 
5 audio^action, back_track) 

while not (completed content list) 
begin 

get next content (cut_order) 

excess ^ check^fit ( content_length, structure, 
10 intraclip_cutting, intraclip_spacing , 

avoid_cutting ) 

if (excess > 0) 
begin 

y = 0 

15 cut_start = cu.t_method ( excess ) 

cut„end = 0 

while (y < intraclip_cutting ) 
begin 

cut_end = cut ( avoid_cutting + cut_start + 
20 cut_end, structure [y] ) 

y = y + 1 

cut_start = excess - cut_start 

end 

end 

25 

trim ( current_clips , beat_synchronise, 

structure__transition) 
assemble_edit (output, current_clips , play_order, 
structure_transition , audio_action , 
30 back_track) 
end 

trim (end_title, beat_synchronise, structure_transition ) 
assemble_edit (output, end^title, play_order, 

structure_transition, audio_action, back_track) 

35 end //finish 
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